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U ■ This paper presents improved approximation algorithms for the problem of multiprocessor 

Q , scheduling under uncertainty (SUU), in which the execution of each job may fail probabilisti- 

cally. This problem is motivated by the increasing use of distributed computing to handle large, 
computationally intensive tasks. In the SUU problem we are given n unit-length jobs and m ma- 
chines, a directed acyclic graph G of precedence constraints among jobs, and unrelated failure 
probabilities t&j for each job j when executed on machine i for a single timestep. Our goal is to 
find a schedule that minimizes the expected makespan, which is the expected time at which all jobs 
complete. 

Lin and Rajaraman gave the first approximations for this NP-hard problem for the special cases 
of independent jobs, precedence constraints forming disjoint chains, and precedence constraints 
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forming trees. In this paper, we present asymptotically better approximation algorithms. In 
particular, we give an 0(log log(min{m, n}))-approximation for independent jobs (improving on 
the previously best O(logn)-approximation). We also give an 0(log(n + m) log log(min{m, n}))- 
approximation algorithm for precedence constraints that form disjoint chains (improving on the 

previously best O ^log(n) log(m) lo g°f ^+^ ) -approximation by a (log nj log log n) 2 factor when n = 

m®^ 1 ). Our algorithm for precedence constraints forming chains can also be used as a component 
for precedence constraints forming trees, yielding a similar improvement over the previously best 
algorithms for trees. 
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1 INTRODUCTION 



Our work concerns approximation algorithms for multiprocessor scheduling under uncertainty, first 
introduced in [12]. This model extends the classical construction of machine scheduling to handle 
cases where machines run jobs for discrete timesteps and succeed in processing them only proba- 
bilistically. Our motivation stems from the increasing use of distributed computing to handle large, 
computationally intensive tasks. Projects like Seti@Home [1] divide computations into smaller jobs 
of relatively uniform length, which are then executed on unreliable machines (e.g., of volunteers). 

Scheduling multiple machines to process the same job at once can help overcome the problem 
of unreliable machines, but many machines processing a single job can also slow down overall 
throughput. The situation is exacerbated when precedence constraints among jobs are present, 
which is often the case for sophisticated computations; here a single job failing may delay the start 
of many others. Note that the special case of having no precedence constraints retains practical 
significance. Google's MapReduce architecture [3], for example, generates jobs whose dependencies 
form a complete bipartite graph, which is equivalent to two phases of independent jobs. 

Motivated by these examples, we study the multiprocessor scheduling under uncertainty 
(SUU) problem. An SUU instance is comprised of a set of n unit-time jobs and a set of m machines. 
For each machine i and job j, we are given a failure probability qij, which is the chance that job 
j does not complete when run on machine i for a single timestep. Any precedence constraints 
are modeled as a directed acyclic graph (dag). Our objective is to construct a schedule assigning 
machines to eligible jobs at each timestep, minimizing the expected time until all jobs have suc- 
cessfully completed. In contrast to many other scheduling problems, SUU allows multiple machines 
to execute the same job in a single timestep. 

Related work 

Malewicz's initial presentation of SUU [12] includes a polynomial-time dynamic-programming so- 
lution for instances where both the number of machines and the width of the precedence dag are 
constant. If either of these constraints is relaxed, he proves that the problem becomes NP-Hard. 
Furthermore, when both constraints are removed, there is no polynomial-time approximation al- 
gorithm for the problem achieving an approximation ratio lower than 5/4, unless P = NP. This 
work does not include approximation algorithms for the general (NP-Hard) problem. 

Lin and Rajaraman present the first (and, to date, only) approximation algorithms for SUU [11]. 
Using a greedy approximation algorithm to maximize the chance of success across all jobs, they give 
an 0(log n)-approximation when all jobs are independent. More sophisticated techniques, including 
LP-rounding and random delay [9, 15], yield a variety of 0(poly log(n + m)) approximations when 
precedence graphs are constrained to form only disjoint chains, collections of in- or out-trees, 
and directed forests. For these settings, our algorithms improve their approximation ratios by a 
(log nj log log n) 2 factor, when n = m & ^\ See Table 1 for a complete comparison. 

The wider field of machine scheduling is an established and well-studied area of research with a 
large number of variations on its core theme (see [6] for a survey). There are three main differences 
between SUU and problems studied in the literature. First, in SUU, jobs may run on multiple 
machines in the same timestep. Second, each job has a chance of failing to complete on any 
machine that processes it. Third, jobs must be scheduled at unit granularity. 

Of deterministic scheduling problems, SUU most closely resembles R\prec, pmtn\C mSLX [7,9,15], 
the problem of preemptively scheduling jobs with precedence constraints on unrelated parallel 
machines so as to minimize makespan. Instead of failure probabilities q^, there is a deterministic 
processing time pij, denoting how long it takes for machine i to complete job j. In contrast to SUU, 
however, machines never fail, and jobs may only run on one machine at a time. As in [11], techniques 
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for this problem play an important role in our approximations. We also borrow techniques from 
"job-shop scheduling" [5] for our SUU algorithms for precedence constraints, but the particulars of 
that setting are not very similar to the ones we consider here. 

There is also a large body of work in stochastic scheduling (see [14, Part 2] for a representative 
sample). The majority of the work in this area considers how to schedule jobs whose input lengths 
are not known, but instead given as random variables distributed according to some probability 
distribution. Particular attention has been paid to the case when these distributions are restricted 
to exponential families, which is similar in some respects to SUU (see Appendix C). However, we 
know of no approximation algorithms, even with this restriction, when machines are unrelated. 

Our results 

We give improved approximation algorithms for SUU when jobs are independent (there are no 
precedence constraints), and when the precedence constraints form disjoint chains. For independent 
jobs, we give an 0(log log(min{m, n}))-approximation algorithm. One component of this algorithm 
is based on an LP relaxation. 

Our analysis for independent jobs relies on a competitive analysis [17]. Essentially, we show 
that our algorithm is 0(log(p max /p m i n ))-competitive for a deterministic scheduling problem (similar 
to R\pmtn\C maiX ) in which each machine has a deterministic speed, but processing times for jobs 
are chosen arbitrarily by an adversary, with minimum and maximum values p m i n and p max - This 
competitive result is interesting in its own right. 

When the precedence constraints form a collection of disjoint chains, we have an 0(log(n + 
m) log log(min{m, n}))-approximation algorithm. Our disjoint-chains algorithm uses an LP relax- 
ation similar to the one used for independent jobs. We also apply techniques from network-flow 
theory and prior work on the SUU problem for chains [11]. The loglog(min{rre, n}) factor arises 
from the independent-jobs algorithm — therefore improving that algorithm immediately yields a 
better algorithm for chains. 

Our algorithm for disjoint chains can be extended to yield an 0(log(n+m) log(n) log log(min{m, n}))- 
approximation for directed forests using the chain-decomposition techniques of [7, 1 1] . 

We also show how to apply our algorithms to similar variants in the problem of stochastic 
scheduling, where jobs have stochastic processing times. To the best of our knowledge, these are 
the first approximation algorithms for stochastic scheduling with the expected-completion-time 
objective and unrelated machines. 

Paper organization 

In Section 2 we give formal definitions of the SUU problem, the scheduling algorithms that we apply 
to it, and an equivalent formulation of SUU that plays an important role in our approximation 
algorithms. A full treatment of this reformulation is given in Appendix A. Section 3 presents our 
algorithms for independent jobs, and Section 4 shows how to extend them to handle precedence 
constraints forming chains. We defer tree-like precedence constraints and stochastic scheduling to 
Appendices B and C, respectively. 



2 



2 PRELIMINARIES 



In this section, we give a formal statement of our problem and then define what we mean by a 
schedule. Most of our notation is consistent with that of Malewicz [12] or Lin and Rajaraman [11]. 
We then present a reformulation of the SUU problem, which we use in subsequent sections to 
simplify both our algorithms and the analysis involved. 

The SUU problem 

An instance / = (J, M,{qij},G) of the SUU problem includes a set J of unit-step jobs and a set 
M of machines. Throughout this paper, we let n = \J\ be the number of jobs and m = \M\ be 
the number of machines. For each machine i and job j, we are given a failure probability q^, 
which is the probability that job j does not complete when run on machine i for one unit step; 
these probabilities are independent. Without loss of generality, we assume that for each job j, there 
exists a machine i such that qij < 1. 

An SUU instance also includes a set of precedence constraints comprising a directed acyclic 
graph (dag) G with jobs as vertices. We say that a job j is eligible for execution at time t if 
all jobs preceding j (i.e, jobs having a directed path to j) in the dag have successfully completed 
before time t. If a job j is eligible at time t, a schedule may assign multiple machines Mjj C M to 
execute j in parallel. As all machine/job failures are independent, the probability that j does not 
complete in that timestep is ILeA/ Qij- 

Failure probabilities are difficult to work with because they multiply. Instead, we define the log 
failure of job j on machine i, denoted by £ij, as £ij = — log q^. Here and throughout the paper, 
we use log to mean a base-2 log. Note that by definition, q^ = 1/2^', and hence ILeM t = 
l/2^ ieM i.*^. We define the log mass of an assignment to job j in step t be the sum of the log 
failures, given by J2ieMj t We also use log mass to refer to the sum of log masses across multiple 
timesteps; that is, the log mass accrued from step t\ through step ti is given by Ylt=t SieM 

Our work focuses on finding scheduling algorithms that minimize expected makespans for re- 
stricted classes of precedence constraints. If there are no precedence constraints, we say that the 
jobs are independent, and refer to the problem as SUU- 1 . When the precedence constraints form 
a collection of chains, we call the problem SUU-C . When the constraints form a collection of 
disjoint trees, we call the problem SUU-T . We use sans serif fonts to refer to problem variants, 
whereas serif fonts refer to algorithms / schedules for the problem. 

Schedules 

A schedule £ is a policy for assigning machines to (uncompleted) jobs. Jobs must be scheduled at 
a unit granularity, but the schedule may assign multiple machines to the same job. A schedule may 
base its decisions on any of its history, but we concern ourselves with only schedules that can be 
computed in polynomial time. More formally, a schedule is a function £ : (JJxN) — > (M — > JU{_L}) 
that, given a history 1 h G H and time t G N, returns a function assigning machines to jobs. We use 
the symbol _L to indicate that the machine remains idle. To allow for more concise schedules, the 
assignment function returned by T,(h,t) may map a machine to a job that has already completed. 

We define an execution of S as follows. Suppose that h £ H is the history of the execution up 
to time t. Then S assigns machine i to job j = Y±(h,t)(i) at step t. If j has been completed when 

X A full history for a deterministic schedule can be captured by the sets of remaining jobs at each timestep prior 
to the current timestep t. More formally, let Ht = {{Si, S2, ■ ■ ■ , St)\J = Si D S2 3 • • • 3 St} denote the set of all 
feasible ordered sets of remaining jobs at timesteps 1,2, ... ,t. Then valid histories are given by the set H = Ut^i Ht- 
Note that compact representations of the history exist, so a polynomially computable schedule may consider the 
entire history. 
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it is scheduled to run, i is assigned to _L. Since J, M, and G are invariant over a problem 

instance, we allow £ to reference those implicitly. 

Whenever the schedule S is such that it assigns machines to jobs depending only on the current 
time and the initial set of jobs, not the jobs that have completed (i.e., for all t, T,(h,t) = Ti(h',t) 
for all h, h' G H), we say that the schedule is oblivious. An oblivious schedule has finite length if 
it is only defined for t <t a , for some t Q . 

We say that a schedule is semioblivious if it can be decomposed into "rounds" such that 
the assignments within each round are characterized by finite oblivious schedules. Thus, while 
executing a step contained in a particular round, the assignment of machines to jobs depends only 
on the initial set of jobs when the round began and the number of steps the round has been running. 

We let Ts be a random variable denoting the length of the execution of schedule S, which is the 
number of steps before all jobs have completed. Our objective is to minimize E[2~y (denoted by 
E[C max ] in much of the scheduling literature). We refer to a schedule that has minimum expected 
makespan as Sopt, and its expected makespan, which is finite [12], as E[Topt]- For any SUU 
instance, Xopt exists, and can be computed (inefficiently) by selecting the assignment of jobs to 
machines on a particular timestep that minimizes the expected makespan of the remaining jobs. 

In this paper, we consider schedules that are polynomial-time computable (in n, m, and 
logEpopT]) and whose expected makespans approximate E[Topt]- We say that S is an a- 
approximation if E[Ts] < «E[Topt] for all choices of probabilities 

Throughout the remainder of this paper, we use algorithm and schedule interchangeably. More- 
over, we generally do not give the schedule explicitly as a function assigning machines to jobs. 
Instead, we describe it algorthmically. 

Problem reformulation 

We now describe a new, and equivalent formulation of the SUU problem, which we refer to as SUU*. 
Because of their equivalence, we refer to both problems as SUU later in the paper. A full treatment 
is presented in Appendix A. 

An SUU* instance / = (J, M, {qij}, G) has the same structure as an SUU instance. The difference 
is that rather than considering the success or failure of a job as it runs on machines in each timestep, 
we use the Principle of Deferred Decisions [13] to view the problem as one of deterministically 
scheduling jobs with randomly distributed lengths. 

Instead of failure probability, in SUU* we view = — log qij as an amount of "work" that 
a machine does towards a job completion in each unit timestep. As in SUU, machines must be 
scheduled at a unit granularity. At the start of a schedule's execution, we draw for each job j a 
single random variable rj chosen uniformly at random from the (0, 1) interval. A job j completes 
when the total work done (or, "log mass" accrued) on j exceeds — \ogrj. That is, j completes at 
the first step t in which Y%=i SigM ^ij — ~^°& r j- As schedules are oblivious to these rj, they 
behave the same way on SUU and SUU* instances. 

3 INDEPENDENT JOBS 

This section describes an 0(loglogn)-approximation algorithm for SUU-I, the SUU problem with 
independent jobs. We first give an oblivious 0(logra)-approximation algorithm for SUU-I, based on 
scheduling an (approximation of an) integer linear program. We then modify this algorithm into a 
semioblivious solution consisting of O (log log n) nearly optimal phases. 

An oblivious 0(log n)-approximation 

We now describe an 0(log n)-approximation for SUU-I, called SUU-I-OBL. Our approach constructs 
a schedule of length 0(E[Tqpt]), based on an integer linear program, such that each job has no more 
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than a constant probability of failure upon completion. This finite oblivious schedule is repeated 
until all jobs have completed. Using Chernoff bounds, we conclude that the expected number of 
repetitions is O(logn), yielding an 0(logn)-approximation. 

We use the following integer linear program for SUU-I-OBL. Let Xij denote the number of steps 
during which machine i is assigned to job j. Recall tij = — log qij is the log failure of job j on 
machine i. Let L be a fixed positive real, representing a target log mass for each job, and let J' C J 
be a subset of jobs that need to achieve that log mass. For now, think of L as being fixed at L = 1/2, 
and J' = J. We assign different values to L later for the semioblivious 0(loglogn)-approximation. 

(LP1) mint 

s -t- T,ieM £ ij x ij > L G J' (1) 

E je j x v < * ViGM (2) 

Xij e NU{0} Vi€M,j€J. (3) 

Here Equation (1) enforces that every job in J has a failure probability no greater than l/\/2, 
and Equation (3) guarantees that all jobs are scheduled for an integral number of steps on each 
machine. We use (LP1) to refer to this integer linear program generically, and LP1(J' , L) to refer 
to it with particular values of J' and L. We denote the optimal value for LP1(J' , L) by tipi(j',L)- 

A solution for LP1(J', L) naturally generalizes to a finite oblivious schedule, denoted by ^lpi(J',l)i 
with length tLpuj'n as follows. Consider a machine i, and consider each job j in arbitrary order. 
Assign machine i to job j for timesteps. To finish our description of this schedule, we first claim 
that i,LPi(j,i/2) approximates E[Topt] (the proof appears in Appendix D). Then we show how to 
approximate (LP1) in polynomial time. 

Lemma 1 t LP1 ^ 1/2 ) = 0(E[T pt]) 

The following lemma states that, in polynomial time, we can find an integral assignment that 
approximates (LP1) to within a constant factor. Some aspects of the proof are similar to [11, 
Theorem 4.1], but we add several steps that improve the approximation ratio. 

Lemma 2 There exists a polynomial-time algorithm that computes a feasible solution to LP1(J' , L) 
having value 0(ttpi(j',L))- 

Proof. We relax our integer linear program to a linear program, and then show that the relaxed 
LP can be rounded to yield an integral {x~[j} solution with value 0(T LP uji L \). 

First, let i'^ = minLfy, L}. Then we replace each iij in Equation (1) with yielding the 
constraint J2ieM ^ij x ij — € J'. Note that since assignments are restricted to be integral, 

this change has no effect on either the feasibility or the value of an assignment. Next we remove 
Equation (3) and solve the relaxed linear program. Letting {x*j,t*} be an optimal solution, we 
note that t* < iLPi(j',L)> because integral solutions are feasible. 

Our goal now is to round the LP solution to an integral solution, while not increasing its value 
by very much. We proceed in three steps. First, we group machines having similar i'^ for a job j, 
yielding a single assignment for the whole group. Then, we round those assignments to integers. 
Finally, we show that the rounded assignments satisfy (LP1), using an integral flow network. 

For each job j, we group machines having values within a factor of 2, and determine the total 
assignment to that group. More formally, for each j and integer k, we let D* k = Yli-\iog£'. \=k x ij ^ e 
the total assignment of machines with I'- G [2 fc , 2 k+1 ) to job j. It should be clear that YlieM ^ij x ij — 
Z k D* k 2 k >L/2,foT8lljeJ'. 
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We next round the value of D* k up to [QD* k \. We claim that ^2 k [6D* k \2 k ^ L > V i G J ' ■ Note 
that since i'^ p£ L, the maximum value of k having nonzero D* k is [log . Thus, the claim follows 
because Z k l6D* k \2 k > 3(2 £ fc D* jk 2 k ) - (Ek<io g L^) > 3(£) - (Ek= L/2 k ) > 3L - 2L = L. In 
other words, rounding the group assignments down to integers can only cause us to lose a log mass 
of at most 2L. We thus need an assignment giving 3L log mass to the job. 

To complete the integral assignment, we construct a network-flow instance as follows. For each 
job j and integer k, we have a node Ujk- For each machine i, we have a node V{. We also add 
a source-node s and a sink-node w. For each Ujk, we add a directed edge (s,Ujk) with capacity 
L6-D*jJ . For each Vi, we add a directed edge (vi, w) with capacity |~6£*] . Finally, we add a directed 
edge (ujk,Vi) with infinite capacity, for any j,k,i such that [log = k. Note that for a given j 
and i, there is exactly one k such that (ujk,Vi) exists. We refer to this edge as edge (j, i). 

Note that if we make the capacity of edges (s,Ujk) be 6D* fe instead, then a flow of demand 
Yljk ^Djk exists in this network. Thus, a flow of capacity Yljk L^j/J exists when we lower the 
capacity of edge (s, Ujk) to [6Z?^ fc J . 

Ford-Fulkerson's theorem [2,4] states that an integral max flow exists whenever the capacities 
are integral, as they are here. We therefore take the flow across the edges (j, i) as our integral 
assignments Xij. Moreover, by construction, x%j satisfy j Xq < [6**1 for all i £ M, and 
J2ieM ^'ij^ij — J2klP-D* k \2 k > L for all j G J. We thus have an integral feasible solution {xij,6t*}. 
Noting that 6t* < ^T LP1 iji L \ completes the proof. □ 

Recall that Lemma 1 shows that iLPi(j,i/2) = 0(E[Topt])- Then Lemmas 1 and 2 in concert 
state that in polynomial time, we can find a schedule S of length 0(E[Topt])j such that every 
job has at most a constant probability of failure. Repeating S until all jobs complete gives our 
oblivious schedule SUU-I-OBL. The proof appears in Appendix D. 

Theorem 3 Let Tguu-i-OBL denote the random variable corresponding to the amount of time it 
takes for an execution of SUU-I-OBL to complete all jobs. Then E[Tsuu-I-OBl] = 0(E[Topt] logn). 

A semioblivious 0(loglog(min{m, n}))-approximation 

We construct our semioblivious schedule SUU-I-SEM as follows. The schedule is divided into 
"rounds." The first round corresponds to an execution of the schedule suggested by the (rounded) 
solution to LP1(J, 1/2). In each following round, (LP1) is applied to all remaining jobs with 
doubling targets. If Jk C Jk-i C J are the set of jobs left at the start of round k, then for that 
round we find an approximate solution to LPl(Jk,2 k ~ 2 ), and schedule obliviously according to it. 

SUU-I-SEM runs at most K = [loglogminjm, n}~\ + 3 of these rounds. If uncompleted jobs 
remain after the Kth. round, one of two things is done. If n < m, SUU-I-SEM runs each job one 
at a time on all machines, until all jobs are completed. If m < n, SUU-I-SEM simply repeats the 
schedule ^LPi(j k ,2 k ~ 2 ) given by the Kth round until all jobs complete. 

The following theorem states that SUU-I-SEM achieves an 0(log log(min{m, n}))-approximation. 
The key aspect of our analysis is viewing SUU-I-SEM as an "online algorithm" to solve the SUU* 
problem over the hidden input {rj}. We compare the length of SUU-I-SEM's schedule against 
that of an optimal offline algorithm, called OFF, that knows the values {rj}. In particular, we 
show that if OFF takes total time t on input {rj}, then each round of SUU-I-SEM takes time 0(t) 
on the same input. This part of our proof is essentially a competitive analysis [17]. 

Theorem 4 Let K = [loglogmin{m,n}l + 3 and let Tsuu-i-SEM denote the random variable 
corresponding to the amount of time it takes for an execution of SUU-I-SEM to complete all jobs. 
Then E[T SU u-I-SEm] = 0(E[r p T ] ■ K). 
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Proof. First, we show that for any fixed set of random values {rj}, each round of SUU-I-SEM 
takes time proportional to the optimal strategy OFF. Then, we show that the expected time to 
complete the first K rounds is 0(E[Topt] ■ K). Finally, we analyze the cases when SUU-I-SEM 
does not complete in K rounds. 

Consider an optimal offline strategy OFF that knows the random values of {rj}, and let 
Toff ({^j}) denote OFF's makespan given the values {rj} (i.e., Toff({^}) is the minimum over all 
strategies for a fixed {rj})- For each k £ {2, 3, . . . , K}, let </& C J be the subset of jobs such that 
— logrj > 2 fc ~ 3 . Thus, OFF must assign machines to job j € such that J2ieM x ij^ij ^ 2 fe ~ 3 . 
Thus, we have T OF f({^}) > T LP ^ Jk 2k -zy 

Now consider an execution of SUU-I-SEM for the same {rj}. We note that if a job j remains 
uncompleted at the start of the fcth round, for k £ {2,3, ... K}, then — logr.,- > 2 fc ~ 3 . This 
inequality follows from the fact that in the (k— l)th round, we give every job a log mass that exceeds 
2 fe_3 . Hence, the jobs executed in the fcth round are a subset of those defined by above. By 
Lemma 2, the fcth round takes time 0(T L p 1( -j fc 2 '!-2))- Observing that F LPX ^j k ^k-i- ) < 2T L p 1 (j fc 2 '=-3), 
we conclude that SUU-I-SEM's fcth round takes time 0(T ff({^})). 

We thus have that for any particular {rj}, if SUU-I-SEM completes in d+ 1 < K rounds, then 
it takes total time 0(E[Topt] + dToFF({rj}))- The E[Topt] term comes from the time it takes to 
execute the first round (as in Lemma 1). Since Topt > E[Toff], where Toff is the time taken 
by OFF on a randomly selected {rj}, we conclude that the expected time for SUU-I-SEM's first 
d + 1 < K rounds is 0(E[T OPT ] ■ d) = 0(E[T OPT ] ■ K). 

We now consider the case when SUU-I-SEM has not completed after K rounds. Let Fk be 
Bernoulli r.v. that is 1 if all jobs have completed after the Kth. round and otherwise. Clearly, 
E[Tguu-i-SEM|Tft; = 1] = 0(E[Topt] • K). To complete the proof, we must show that Vv{Fk = 
0}E[Tstju-i-sem|Tx = 0] < 0(E[Topt] ■ K). We consider the remainder of the proof in two cases, 
depending on whether n < m or m < n. The case of m < n appears in Appendix D. 

Suppose n < m and Fk = 0. Recall that after the Kth round, SUU-I-SEM runs jobs one 
after the other. Recall also that remaining jobs must have — logr, > 2 K ~ 2 > 2 lo s lo s n + 1 > 21ogn. 
This event occurs when rj < 1/n 2 , which, by the union bound, happens with probability no more 
than 1/n. Running jobs one at a time is trivially an 0(n)-approximation, so we conclude that 
Pt{F k = 0}E[T S uu-I-SEm|T^ = A n < m) < 0(E[T OPT ]^). □ 

4 JOBS WITH CHAIN-LIKE PRECEDENCE CONSTRAINTS 

This section gives our 0(log(n + m) log log(min{m, n}))-approximation for SUU-C, the case when 
precedence constraints form a collection of disjoint chains. Our algorithm may be used as a 
subroutine for SUU-T, the more general case where precedence constraints form disjoint trees (see 
Appendix B). 

In SUU-C, the dependency graph G is a collection disjoint chains G = {C\, Ci, ■ ■ ■ , C z }, where 
each Cfc gives a total order on a subset of jobs. If job j\ precedes j2 in a chain, we write j\ -< ji- 

Our algorithm for disjoint chains is similar to Lin and Rajaraman's algorithm [11], but we 
achieve a better approximation ratio through various improvements. We first give an overview of 
the algorithm. We provide more details later in the section. 

To construct our schedule, we first find assignment {xij} of machines to jobs (where x\j is an 
integral number of steps for which machine i is assigned to job j), giving each job one unit of log 
mass, such that the "length" and "load" of the assignment are bounded by 0(E[Topt])- The load 
of a machine is the number of timesteps for which any job is assigned to it (i.e, ^2jXij), and the 
load of the assignment is the maximum across all machines. The length of a chain is the sum of 
the length of the jobs in the chain. The length of a job j, denoted by dj, is the maximum number 
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of steps for which j is assigned to a single machines (i.e., dj ) . Clearly, a schedule taking 

time T must have a length and load no more than T. 

We use an LP relaxation (similar to (LP1) in Section 3) to generate our assignment. Details 
appear later in the section. As in Section 3, our LP relaxation achieves an 0(l)-approximation . 
Note that this assignment does not immediately yield a schedule. 

As we transform our assignment into an adaptive schedule, we treat long and short jobs differ- 
ently. We say that a job is short if the length of its assignment is at most some value 7, to be 
defined later, and the job is long otherwise. To simplify presentation, suppose for now that all 
jobs are short. We later describe how to deal with long jobs. 

We then transform the assignment into an adaptive schedule for each chain Ck- The schedule 
Sfc considers the next eligible (uncompleted) job j in and (obliviously) schedules the next dj 
timesteps according to the assignment {x^}. Specifically, if begins executing job j at time t, 
then it schedules j from time t to t + xij on machine i. (Machine i remains idle from time t + Xij to 
t + dj.) After the dj timesteps, again considers the next eligible job in the chain (which may be 
the same job if it failed). We note that each time job j is obliviously scheduled, it has a constant 
probability of success. 

We then combine all the in a straightforward manner, yielding a "pseudoschedule" for the 
SUU-C instance, denoted by {S^}. In particular, a pseudoschedule runs all "in parallel," 
possibly assigning multiple jobs to the same machine in each timestep. To avoid confusion, we call 
each of the timesteps of a pseudoschedule a super step, and we call the number of jobs assigned 
to a single machine during a superstep t the congestion at that superstep, denoted by c(t). We 
"flatten" each superstep to c(t) timesteps by arbitrarily ordering the jobs assigned to each machine, 
thus yielding a schedule called SUU-C. If c max is the maximum congestion over all supersteps, and 
Z is the maximum length of any chain, then SUU-C comprises 0(c max Z) timesteps. 

To reduce congestion, we apply a random-delay technique [9, 16], also used by Lin and Ra- 
jaraman [11]. We also utilize the fact that when chains consist of sufficiently many (short) jobs, 
the number of supersteps spanned by is near the expected length of with high probability. 
To deal with long jobs, we run SUU-I-SEM 0(log(n + m)) times, which dominates the runtime, 
yielding the 0(log(n + m) loglog(min{m, n}))-approximation. 

Finding an assignment with low load and length. 

As in Section 3, we use an integer linear program to optimize for the constraints. This integer 
linear program for chains matches that used in [11, LP1]. 



(LP2) mint 

s-t. EieM £ ij x ij > 1 Vj G J (4) 

EjeJ x v < t VieM (5) 

Zjec k dj < t VC k eG (6) 

0<Xij < dj VieMJeJ (7) 

dj > 1 Vj G J (8) 

x^ G N U {0} Vi G M,j G J . (9) 



Equations (4), (5), and (9) correspond to Equations (1), (2), and (3), respectively, in (LP1). 
Equation (5) bounds the load of each machine. Equation (6) bounds the length of each chain, and 
Equations (7) and (8) determines the length of each job. 

The following lemma, proven in [11, Lemma 4.2], states that the optimal value for (LP2) is a 
lower bound on E[Tqpt]- 
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Lemma 5 Let t( LP2 ) be the optimal value for (LP2). Then t^ LP2 ) = 0(E[Topt])- D 

The next lemma exhibits an 0(l)-approximation to (LP2). Lemmas 5 and 6 together imply a 
polynomial-time algorithm giving an integral assignment {xij} of machines to jobs, such that the 
load and length are both 0(E[Topt])- 

Lemma 6 Let tri P2 \ be the optimal value for (LP2). There exists a polynomial-time algorithm 
that computes a feasible solution to (LP2) having value 0(t( LP2 ))- 

PROOF. The rounding proceeds as in Lemma 2, starting by removing Equation (9) and replacing 
Equation (4) by X^ieM ^ij x ij — 1> f° r = mm {^ij, 1}- The only major difference is in the capacity 
of some edges in the flow network. Instead of giving edge an infinite capacity, we restrict 
the capacity of edge (j, i) to \6al*~\ , where d* is the assignment given by the optimal solution to 
the relaxed linear program. We note that the length of a chain Ck may increase up to at most 
^jeC k d* + \C k \<7j: jeCk d*. □ 

Reducing congestion of SUU-C 

As described thus far, SUU-C may have 0(n) congestion. We take advantage of a random-delay 
technique [9, 16] to reduce congestion to O ( ip'giog (n+m) ) ' wr th high probability. Essentially, we 
modify SUU-C to simply delay the start time of each chain by a value chosen uniformly at random 
from {0, 1, ... , H}, where H is the load of SUU-C. 

The delay technique is summed up by the following theorem, proof omitted (as similar theorems 
appear elsewhere). It originates in [9], and Lin and Rajaraman [11, Section 4.1] outline the necessary 
proof as applied to SUU-C. 

Theorem 7 Consider a pseudo schedule with total load H , where H is polynomially bounded 

in n and m. Consider the pseudo schedule generated by randomly shifting, or "delaying," the 

start time of each chain schedule by a value chosen uniformly at random from {0, 1, . . . ,H}. 
Then {T,^} has congestion at most O ( lo '° fog^+ra) ) , with high probability with respect to n and m.D 

Notice that whenever the load of and length of are bounded by 0(E[Topt]), it follows that 
the length of is at most 0(E[Topt]) supersteps, with high probability. 

Since Ej, repeats the assignment for some jobs, the load and length of the pseudoschedule 
are random variables. We note, however, that the random successes and failures of jobs (and hence 
load and length) are independent of the initial random delay selected. Thus, as long as our random 
execution yields a load and length of 0(E[Topt])j then Theorem 7 implies that SUU-C consists of 

0(E[T OP t]) supersteps, for a total time of 0(E[Top T ]c max ) = 0(E[T OPT ] J g f ( g "(n+i) ) ste P s - 

The following lemma implies that most executions of SUU-C result in low load and length. In 
this lemma, yj is the random variable indicating the number of repetitions of job j's assignment used 
to complete j, and dj denotes the length of job j's assignment. In the SUU-C context, rj = n + m. 
The value W here represents the load or the length of the assignment. The lemma states that 
whenever a job has length (or causes load) that is logarithmically smaller than the total, then the 
length of the chain or (load on a machine) is close to the expectation, with high probability. Union 
bounding over all 0(n) chains (or m machines) implies that the total length (and load) of SUU-C 
schedule is close to the expectation, with high probability. The proof appears in Appendix D. 

Lemma 8 Consider yj G N drawn from the geometric distribution Pr{yj = k G N} = (l/2) k , and 
let 1 < dj < Wj log n be a weight associated with each yj for any values such that W > J2j an d 
log rj < W . Then J2j Ujdj < 0(cT) with probability at least \ — \jrf. 
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We conclude that if jobs are short, where short jobs have length at most 7 = iLP2/log(n + 
m), and if thPi is polynomial in n and m, then SUU-C takes time 0(E[Topt] lo'giog^n+m) ) with 
high probability. To get this bound in expectation, we simply modify SUU-C to run the 0(n)- 
approximation (as for SUU-I-SEM) whenever congestion, load, or length exceed the desired bounds, 
which occurs with probability at most 1/n. 

Handling long jobs 

We now extend SUU-C to handle jobs having length more than tiP2/log(ra + m). In the chain 
schedule we replace each longer job by a "pause" of length iiP2/log(n + m). Specifically, 
no job from the chain is scheduled until iLP2/log(ra + m) supersteps later. We then divide our 
schedule SUU-C into 0(log(n + m)) segments of length tLP2/log(n + m) supersteps. Note that 
by construction, there is at most one pause per chain per segment. After executing each segment, 
SUU-C executes SUU-I-SEM on the jobs corresponding to the pauses starting in that segment 
(suspending the rest of the chains until completion). Once those long jobs complete, SUU-I-SEM 
continues to the next segment. 

All of our previous analyses (that assume jobs are short) still hold. In particular, we satisfy the 
requirement that all long relevant long jobs complete before the short jobs are scheduled again. Since 
there are 0(log(n+m)) executions of SUU-I-SEM, it follows that the total expected time increases to 
O (E [Topt]- log (n+m) loglog(min{m, n})), and hence we have an 0(log(n+m) log log(min{m, n}))- 
approximation. 

Extending to nonpolynomial t^P2 

We now address the requirement in Theorem 7 that load and length be polynomially bounded in n 
and m. We make use of a trick from [16, Section 3.1], also used in [11]. Consider the chain schedule 
Sfc (having length 0(tLP2), with high probability) before the random delay is applied. We round 
each assignment Xij down to the nearest multiple of tLP2/nm. We thus treat the assignments as 
integers in the range {0,1, ... ,0(nm)}. We can then apply the random-delay technique (from 
Theorem 7) to these rounded assignments. 

The issue now is that the rounding may have decreased many assignments, so we reinsert steps 
into the schedule. In particular, whenever executing job j, we reinsert steps (not supersteps) into 
the execution, executing only job j during those steps. Specifically, the execution of job j may 
result in reinserting at most an expected 2tLP2/nm steps for each machine, and hence 2tiP2/n 
steps in total. Summing across all n jobs gives an expected 2tpp2 steps, thereby increasing the 
total length of SUU-C by 0(E[T pt]). 

Theorem 9 Let Tsuu-C denote the random variable indicating the time at which an execution of 
SUU-C completes all jobs. Then E[Tguu-c] = 0(EP~opt] ■ log(n + m) log log(min{m, n})). □ 

5 CONCLUSION 

In this paper, we have presented improved approximation algorithms for multiprocessor scheduling 
under uncertainty. We believe that our bounds our not tight. In particular, we believe that a 
fully adaptive schedule should be able to trim an 0(loglog(min{m, n})) factor from our bounds. 
It would also be interesting if a greedy heuristic could achieve the same bounds. Finally, we would 
be interested in developing nontrivial approximations for more general precedence constraints. At 
first glance, however, it seems like any technique for SUU and arbitrary precedence constraints may 
generalize to R\pmtn, prec\C max , which remains unsolved. 
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A PROBLEM REFORMULATION 



This section presents a full description of our reformulation of the SUU problem. We use this new 
formulation to simplify both our algorithms and the analyses involved. 

To disambiguate between the original statement of the SUU problem given in Section 2 and the 
new one described herein, in this section we refer to the new formulation as SUU*. Since we show 
that they are equivalent, we refer to both problems as SUU later in the paper. 

As with SUU, an SUU* instance includes a set J of jobs, a set M of machines, and a set of 
precedence constraints forming a dag. Also as before, the instance specifies a real q^ € [0, 1] for 
each job j and machine i. In SUU, q^ specifies a failure probability. In SUU*, however, we do not 
view q^ as a probability; instead we view 1^ = — log q^ as an amount of "work" that a machine 
does towards a job completion in each unit timestep. As in SUU, machines must be scheduled at a 
unit granularity. 

We model the stochastic nature of the problem in SUU* by associating with each job j a single 
random variable rj chosen uniformly at random from the (0, 1) interval. We say that a job j 
completes once the total work done (or log mass accrued) on j exceeds — log rj . More formally, let 
Mj :t be the set of the machines assigned to job j by a schedule on time t. Then j completes during 

the first step t in which Y? k =i ^ieM jtk > ~ lo g r j> or equivalently njLi HieM jik Qij ^ r r 

Note that a schedule £ is oblivious to the random value rj. Instead, it is only aware of whether 

a job completes in each timestep. Thus, a schedule must make its decisions for assignments in 

step t based only on the surviving sets of jobs Si, S2, ■ ■ ■ , St for each of the preceding timesteps. 

Hence, the same schedule may be applied to both SUU and SUU*. 

The following theorem states that SUU and SUU* have the same distribution over states of 

uncompleted jobs in each timestep. 

Theorem 10 Consider executions X and X* of schedule £ on SUU-instance I = (J, M, {qij} , G) 
and corresponding SUIT -instance I* = (J, M , {qij} , G) , respectively, that run for t — 1 timesteps. 
We define the history of the execution X (and X* ) after t — 1 steps, denoted by h t (and hi), as a 
sequence of job subsets h t = (Si, S2, ■ ■ ■ , St), where S^ C J is the set of uncomplete jobs remaining at 
the start of step k in the execution. For any state (Si, S2, ■ ■ ■ , St), we have Pr {ht = (Si, S2, ■ ■ ■ , St)} = 
Pv{h* t = (Si,S 2 ,...,S t )}. 

PROOF. By induction on time t. Initially, Pr{/ii = (J)} = Pr{h\ = (J)} = 1. 

Suppose h t = h* = (Si, S2, ■ ■ ■ , St). Then S makes the same decisions for assigning machines to 
jobs in step t in both executions (i.e., T,(h t , t) = T,(h*,t)). Let M J)t be the set of machines assigned 
to j by S(/if,t). Let St+i and <S£ +1 denote the random variables indicating the subsets of jobs 
remaining after executing step t in X and X* , respectively. We will show that St+i and S% +1 have 
the same distribution. 

For each job j G St, the probability that X does not complete j in step t of the SUU execution X 
is given by Pr{j G S t+ i\h t } = UieM 3 , t Qij- 

We now consider the probability that X does not complete j £ St in step t of the SUU* 
execution X* . By definition, j completes if n*;=i IlieAf,- fc 9»j — r r By assumption, since j £ St, 

we have nl=\ UieM iik Hi > r r Thus ' 

Pr{jes; + il^*} 



Pr < 



7\- < 



n n 



Qij 



k=l i£M 



t-1 

<n n 



Qij 



k=li€M 



3,k 



Pr < 



t-i 



<[ II I 11 n m 

^ieM jit J \k=lieM jtk 



t-1 



7\- < 



n n * 



,k=l ieM. 



3,k 
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= Pr < 



Ti < 



n «j 



r 3 < 1 



= n 

ieM jtt 

Multiplying probabilities Pr{j G St+i|/it} for each job j G S 1 C St and 1 — Pr{j G <Sf+i| /it} for 
eachjob j G S t — S yields Pr{S t+ i = S\h t } = Pr{5 i * +1 = S\h*}. Applying the inductive hypothesis 
(i.e., Pr{/i t } = Pr{/i*}) completes the proof. □ 



Corollary 11 Let and be the random variables denoting the amount of time it takes to 
execute SUU and SUlf instances, respectively, using schedule S. Then E [Ts] = E[T£]. Thus, a 
schedule that gives an a- approximation to SUU* also gives an a -approximation to SUU. 



B JOBS WITH TREE-LIKE PRECEDENCE CONSTRAINTS 

We can obtain algorithms for tree-like precedence constraints by trivially applying techniques 
from [7], as done in [11]. We state the bound here without proof. 

When precedence constraints form a directed forest, the technique from [7] decomposes the 
graph into O(logn) blocks, each consisting of disjoint chains. We then apply SUU-C O(logn) 
times. 

Theorem 12 If precedence constraints form a directed forest, there exists a polynomially com- 
putable schedule with expected makespan 0(E [Topt] ■ log(ro) log(n + m) loglog(min{m, n})). 

C STOCHASTIC SCHEDULING 

This section shows how our algorithms from Sections 3 and 4 apply to the problem of preemptively 
scheduling jobs whose lengths are given by random variables on unrelated parallel machines. Specif- 
ically, we give polynomial time algorithms for problems of the form R\pmtn, prec, pj~stoch\E [C max ] 
with approximation ratios for each type of precedence constraint identical identical to those of pre- 
vious sections, so long as job lengths are drawn from exponential distributions with known means. 

First, we review the stochastic model and how it differs from SUU. Then, we overview an 
0(loglogn)-approximation for R\pmtn,pj~stoch\E [C max ], which we designate STOCH-I. Finally, 
we discuss briefly how to generalize to cases with precedence constraints. 

Preliminaries 

An instance I s toch = (J, M, {Xj} , {vij}) of STOCH-I contains a set of jobs J and a set of machines M 
just as in SUU. For every job j, Xj specifies the rate parameter of the exponential distribution from 
which j's length, denoted by the random variable pj, is drawn. That is, Pr{pj < c} = 1 — e~ cAj . 
Only Xj is given in an instance, and pj is not revealed until the job completes. Finally for each 
machine i and job j, Vij specifies the speed with which machine i processes job j. Specifically, 
let Xij be the amount of time during which machine i processes job j. Then j completes once 
Y^i x ij v ij ^ Pj- This inequality should look very similar to SUU*, which is why our earlier algorithms 
apply here. 

In STOCH-I, Xy need not be integral, but we do require that no job be processed by more than 
one machine at the same time. 

We continue to refer to optimal algorithms by OPT and (the random variable denoting) the 
time they take to run by Topt- 
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An O(loglogn)-approximation for STOCH-I 

We now show how to provide a 0(loglogn)-approximation for STOCH-I, using arguments very 
similar to those in Section 3 and a constant-factor approximation algorithm for R\pmtn\C max . 

Our algorithm STC-I operates similarly to SUU-I-SEM. In particular, STC-I operates in K = 
[log log n + 3] rounds, each corresponding to an oblivious schedule We construct the oblivious 
Sfc such that any job having (stochastically chosen) pj < 2 k ~ 2 /\j completes. Specifically, 
corresponds to (approximately) solving the deterministic analog R\pmtn\C max , setting the length 
of job j to 2 k ~ 2 /\j. Any jobs remaining after the end of these K rounds is run one at a time on 
the fastest possible machine. 

We use the algorithm from Lawler and Labetoulle [8] to compute an 0(l)-approximation for 
R\pmtn\C max in polynomial time, giving us each of our S^. 

The following theorem states that STC-I approximates STOCH-I. Proof (omitted) is similar to 
Theorem 3 and Lemma 1 

Theorem 13 Let Tstc-i be the random variable denoting the time it takes for an execution of 
STC-I to complete all jobs. Then E [T ST c-i] = 0(E [T pt])- 

Proof Sketch. The full proof includes a component similar to Lemma 1, showing that the first 
round (solving for deterministic lengths l/(2Aj)) approximates E[Topt]> and a component similar 
to Theorem 3, using an offline-algorithm argument to prove that the K — 1 subsequent rounds take 
expected time 0(E [Topt] ■ K). 

To complete the proof, we note that when pj are bounded above by 2 log n/Xj, then all jobs com- 
plete during (or before) T,k- Since thepj are exponentially distributed, Pr {3j s.t. pj > 2 logn/Aj} < 
1/n. Running jobs sequentially is an n-approximation, but this sequential run occurs only with 
probability at most 1/n. □ 

A virtually identical algorithm gives an 0(loglog(n))-approximation to the slightly weaker 
setting, R\ restart, pj^stoch | E [C max ]. In this setting, a job must run fully on a single machine, but 
it may be restarted on a different machine. Here, job lengths are stochastically chosen only once. 
The only necessary change to the algorithm is substitute the fcth round with the corresponding 
solution to R\\C max , in lieu of R\pmtn\C max . 

Other results 

Similar analysis yields an 0(loglogm)-approximation for STOCH-I. Substituting assignments gen- 
erated from algorithms for R\pmtn, chains\C max [10] for those given by (LP2) gives an 0(log(n + 
m) log log(min {m, n}))-approximation when precedence constraints form chains, and an 0(log(n) log(n+ 
m) log log(min {m, n}))-approximation when they form directed forests, using the same algorithms 
and techniques. 

D PROOFS 

Lemma 1 i L pi(j,i/2) = 0(E[T OPT ]) 

Proof. Let t be the optimum solution to LP1(J, 1/2). Consider any subset U C J, and its com- 
plement U. Then LP1(U, l/2) + LPl(U, 1/2) > t, since we can construct a solution to LP1(J, 1/2) 
by adding a solution to LP1(U, 1/2) and a solution to LP1(U, 1/2). 

Now recall our view of the problem in terms of SUU*: there is an rj chosen uniformly at random 
from (0, 1) for each job j such that job j completes only if J2ieM ^ij x ij ^ — logrj. For any sample 
from the event space, let U be the set of jobs j for which rj > 1/2, and let U be the complement 
set of jobs j for which rj < 1/2 (note rj = 1/2 with probability so can be ignored). By definition, 
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each job is in U independently with probability 1/2. Next observe that however OPT generates its 
schedule, it must allocate at least 1/2 unit of "work" to each job in U; in other words, Equation (1) 
of (LP1) must hold for every j G U . Thus, the optimum schedule contains a feasible solution to 
LPl(U,l/2). 

Now observe that by construction, U is a uniformly random subset of J, meaning all subsets 
are equally likely. Thus, 



E[T OP t] = 2- n Y,E[T OP T | U] 

u 

= 2 ~ H ■ ^(E ^[ T OPT I U \ + E ^ T OPT I U]) 

u u 

> 2-"l ^2(LP1(U, 1/2) + LP1(U, 1/2)) 

u 

> 2-"^LPl(J,l/2) 



u 

= hpi(j,i/2) 

Where the second line of this derivation follows from the first paragraph of this proof. □ 

Theorem 3 Let Tguu-i-OBL denote the random variable corresponding to the amount of time it 
takes for an execution of SUU-I-OBL to complete all jobs. Then E [Tstju-I-OBl] = 0(E [Topt] logn). 

Proof. From Lemmas 1 and 2, we have a schedule £ of length 0(E [Topt]) that gives each job a 
constant probability of success. Applying a Chernoff bound gives us that a particular job completes 
in O(logn) repetitions of S, with probability at least 1 — l/n®^ 1 ), where the constant exponent 
appears as a constant factor in the number of repetitions. Taking a union bound over all jobs 
gives that with probability at least 1 — l/n e( ~ l \ all jobs complete in O(logn) repetitions. Since this 
probability drops off dramatically as the number of repetitions increases, we have E [Tsuu-I-OBl] = 
0(E [Tqpt] logn). ' □ 



Theorem 4 Let K = [loglogmin{m, n}] + 3 and let Tsuu-I-SEM denote the random variable 
corresponding to the amount of time it takes for an execution of SUU-I-SEM to complete all jobs. 
Then E [T SU u-I-SEm] = 0(E [T OPT ] ■ K). 

PROOF. This proof resumes where the same proof from Section 3 leaves off. In particular, we 
show here that SUU-I-SEM produces an 0(loglogm)-approximation by showing that repeating the 
([log log m] + 3)rd round until completion takes time 0(E [Topt]) • 

Suppose that m < n and Fk = 0. Let = ^LPi(j K ,2 k ~ 2 ) be the schedule computed at the 
end of the Kth phase. Here, SUU-I-SEM repeats T,k until all jobs complete. Define the load H 
of a finite schedule to be the maximum number of timesteps during which any machine is assigned 
to an uncompleted job (i.e., H = maxj ^ ■ Xij); a schedule can be compressed to run in exactly H 
time-steps. We will analyze how long it takes for the load of our instance to drop from its initial 
expected value 0(E [Topt]) (at the end of the iTth phase) to (when all jobs have completed). 

Define X as a random variable denoting the number of timesteps until the compressed load 
of T,K drops to 0, and let Tx be the (random variable) denoting the length of Ex- Then X = 
Yli=v K X{TkI2 % = Tk Yl<i=o K Xi/2" 1 , where Xj is the random variable representing the number 
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of repetitions of necessary to drop its load from Tk/2 1 to Tk/2 %+1 . By construction, a single 
execution of T>k ensures that each job remains with probability at most \/m? , and thus the expected 
load of each machine shrinks by at least a (multiplicative) factor of m 2 . Markov's inequality gives 
us that each machines load decreases by a a factor of m/2. with probability at least l/2m. Taking 
a union bound over all machines gives us that the load of all machines decreases by a factor of 
m/2. with probability at least 1/2. 

For m > 4, we now have that each repetition of decreases the remaining load by a fac- 
tor of 2 with probability at least 1/2, and hence E [Xj\ < 2 for all i — requiring only an ex- 
pected constant number of repetitions to reduce the load by a constant factor. We now have 
that E fe!= g o TK Xi/2*\ < £*=o E PQ] /2* < 0(1). Since each X t and T K are independent, it fol- 
lows that E [X] < E [T K ] J2Zo E \ X i\ l 2i = 0(E [T K ]). Having E [T K ] < 0(E [T OPT ]) completes the 
proof. □ 

Lemma 8 Consider yj G N drawn from the geometric distribution Yr{yj = /c G N} = (l/2) fe , and 
Zei 1 < cZj < Wj log rj be a weight associated with each yj for any values such that W > 2dj and 
log 77 < W. Then ■ Vjdj < 0(cT) with probability at least 1 — l/rj c . 

Proof. Round all the dj up to the next power of 2. Let Z k be the set of j such that T / (2 k log rj) < 
dj < T j (2 k ~ 1 log rj), for k G {1,2,..., log(T/ log rj)}. We apply a Chernoff bound over all jobs in 
Z k to show that the weighted sum of their yj is near the expectation, with high probability. In 
particular, let B k be the sum bk (to be decided later) Bernoulli random variables with probability 
1/2. Then clearly Prji^. < \Z k \} = Pr |^j g ^ fe Vj > We therefore apply Chernoff bounds to 
the B k . We will then union bound over all B k to complete the proof. 

Since there are potentially many nonempty Z k sets, we choose the bk so as to give geometri- 
cally decreasing failure probabilities. In particular, we set bk = a(c)(\Zk\ + log?? + k) for some 
constant a(c), with (1 — l/a(c)) 2 /2 = c. Then a Chernoff bound states that Pr{B k < \Zk\} < 
e -(io g r,+fc)(i-i/ Q ( c )) 2 /2 < „-c e -cfc_ Taking a union bound over all k gives Pr{any5 fc < \Z k \} < 

^fc°=i T//l0g ^ r]~ c e~ ck < r]~ c Y^k=\ e ~° k — T I~ C when c > log 2 e. It follows that Pr |any^ j6Zfc yj > b^ < 
r]~ c . And hence with probability at least 1 — l/r/ c , we have ^ • djyj < ^2 k {T/(2 k log 77) X}jez fc Vj) — 
T. k (Tb k /{2 k \ogn)) < 0(cE,-dj) + (T/ log 77) £ fc (log r? + fc)/2 fc = 0(c£, d,) + OCH = O(cT). □ 
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